Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 296 |
| Missing cells | 409 |
| Missing cells (%) | 5.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 75.3 KiB |
| Average record size in memory | 260.4 B |
Variable types
| NUM | 15 |
|---|---|
| BOOL | 8 |
| CAT | 3 |
| DATE | 1 |
Reproduction
| Analysis started | 2020-05-05 17:16:30.918427 |
|---|---|
| Analysis finished | 2020-05-05 17:17:04.902150 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
month is highly correlated with quarter and 1 other fields | High Correlation |
quarter is highly correlated with month and 1 other fields | High Correlation |
weekofyear is highly correlated with quarter and 1 other fields | High Correlation |
meanwd_udsprevisionempresa is highly correlated with meanwd_udsventa | High Correlation |
meanwd_udsventa is highly correlated with meanwd_udsprevisionempresa | High Correlation |
udsstock has 93 (31.4%) missing values | Missing |
udsventa has 61 (20.6%) missing values | Missing |
udsprevisionempresa has 82 (27.7%) missing values | Missing |
roll4wd_udsventa has 50 (16.9%) missing values | Missing |
meanwd_udsventa has 42 (14.2%) missing values | Missing |
roll4wd_udsstock has 16 (5.4%) missing values | Missing |
roll4wd_udsprevisionempresa has 65 (22.0%) missing values | Missing |
weekday has 42 (14.2%) zeros | Zeros |
sin_weekday has 42 (14.2%) zeros | Zeros |
| Distinct count | 296 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10655.0 |
|---|---|
| Minimum | 35 |
| Maximum | 21275 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 35 |
|---|---|
| 5-th percentile | 1097 |
| Q1 | 5345 |
| median | 10655 |
| Q3 | 15965 |
| 95-th percentile | 20213 |
| Maximum | 21275 |
| Range | 21240 |
| Interquartile range (IQR) | 10620 |
Descriptive statistics
| Standard deviation | 6162.628011 |
|---|---|
| Coefficient of variation (CV) | 0.578378978 |
| Kurtosis | -1.2 |
| Mean | 10655 |
| Median Absolute Deviation (MAD) | 5328 |
| Skewness | 0 |
| Sum | 3153880 |
| Variance | 37977984 |
| Value | Count | Frequency (%) | |
| 14723 | 1 | 0.3% | |
| 4931 | 1 | 0.3% | |
| 19259 | 1 | 0.3% | |
| 19187 | 1 | 0.3% | |
| 4283 | 1 | 0.3% | |
| 13571 | 1 | 0.3% | |
| 13715 | 1 | 0.3% | |
| 9251 | 1 | 0.3% | |
| 12275 | 1 | 0.3% | |
| 9395 | 1 | 0.3% | |
| Other values (286) | 286 | 96.6% |
| Value | Count | Frequency (%) | |
| 35 | 1 | 0.3% | |
| 107 | 1 | 0.3% | |
| 179 | 1 | 0.3% | |
| 251 | 1 | 0.3% | |
| 323 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 21275 | 1 | 0.3% | |
| 21203 | 1 | 0.3% | |
| 21131 | 1 | 0.3% | |
| 21059 | 1 | 0.3% | |
| 20987 | 1 | 0.3% |
| Distinct count | 296 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| Minimum | 2019-06-05 00:00:00 |
|---|---|
| Maximum | 2020-03-26 00:00:00 |
| Distinct count | 1 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 48 |
|---|
| Value | Count | Frequency (%) | |
| 48 | 296 | 100.0% |
Length
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 2 | 100.0% |
| Distinct count | 106 |
|---|---|
| Unique (%) | 52.2% |
| Missing | 93 |
| Missing (%) | 31.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 919.7192118226601 |
|---|---|
| Minimum | 48.0 |
| Maximum | 2316.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 48 |
|---|---|
| 5-th percentile | 369.9 |
| Q1 | 615 |
| median | 882 |
| Q3 | 1153 |
| 95-th percentile | 1560 |
| Maximum | 2316 |
| Range | 2268 |
| Interquartile range (IQR) | 538 |
Descriptive statistics
| Standard deviation | 404.4809227 |
|---|---|
| Coefficient of variation (CV) | 0.4397874019 |
| Kurtosis | 0.8703311871 |
| Mean | 919.7192118 |
| Median Absolute Deviation (MAD) | 316.0531437 |
| Skewness | 0.7004239733 |
| Sum | 186703 |
| Variance | 163604.8168 |
| Value | Count | Frequency (%) | |
| 416 | 6 | 2.0% | |
| 794 | 6 | 2.0% | |
| 552 | 5 | 1.7% | |
| 513 | 5 | 1.7% | |
| 1133 | 4 | 1.4% | |
| 591 | 4 | 1.4% | |
| 1463 | 4 | 1.4% | |
| 814 | 4 | 1.4% | |
| 940 | 4 | 1.4% | |
| 649 | 4 | 1.4% | |
| Other values (96) | 157 | 53.0% | |
| (Missing) | 93 | 31.4% |
| Value | Count | Frequency (%) | |
| 48 | 1 | 0.3% | |
| 82 | 1 | 0.3% | |
| 118 | 1 | 0.3% | |
| 140 | 1 | 0.3% | |
| 290 | 2 | 0.7% |
| Value | Count | Frequency (%) | |
| 2316 | 1 | 0.3% | |
| 2190 | 1 | 0.3% | |
| 2132 | 1 | 0.3% | |
| 1996 | 2 | 0.7% | |
| 1918 | 2 | 0.7% |
| Distinct count | 101 |
|---|---|
| Unique (%) | 43.0% |
| Missing | 61 |
| Missing (%) | 20.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 604.4510638297872 |
|---|---|
| Minimum | 73.0 |
| Maximum | 1719.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 73 |
|---|---|
| 5-th percentile | 262.9 |
| Q1 | 435 |
| median | 590 |
| Q3 | 730 |
| 95-th percentile | 917.1 |
| Maximum | 1719 |
| Range | 1646 |
| Interquartile range (IQR) | 295 |
Descriptive statistics
| Standard deviation | 246.2510375 |
|---|---|
| Coefficient of variation (CV) | 0.407396152 |
| Kurtosis | 4.41123923 |
| Mean | 604.4510638 |
| Median Absolute Deviation (MAD) | 181.3457673 |
| Skewness | 1.343387486 |
| Sum | 142046 |
| Variance | 60639.57345 |
| Value | Count | Frequency (%) | |
| 560 | 7 | 2.4% | |
| 435 | 6 | 2.0% | |
| 494 | 6 | 2.0% | |
| 376 | 5 | 1.7% | |
| 678 | 5 | 1.7% | |
| 642 | 5 | 1.7% | |
| 442 | 5 | 1.7% | |
| 649 | 5 | 1.7% | |
| 605 | 5 | 1.7% | |
| 568 | 5 | 1.7% | |
| Other values (91) | 181 | 61.1% | |
| (Missing) | 61 | 20.6% |
| Value | Count | Frequency (%) | |
| 73 | 1 | 0.3% | |
| 154 | 1 | 0.3% | |
| 199 | 1 | 0.3% | |
| 221 | 1 | 0.3% | |
| 228 | 2 | 0.7% |
| Value | Count | Frequency (%) | |
| 1719 | 1 | 0.3% | |
| 1690 | 1 | 0.3% | |
| 1586 | 1 | 0.3% | |
| 1542 | 2 | 0.7% | |
| 1084 | 1 | 0.3% |
| Distinct count | 200 |
|---|---|
| Unique (%) | 93.5% |
| Missing | 82 |
| Missing (%) | 27.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3028.4345794392525 |
|---|---|
| Minimum | 51.0 |
| Maximum | 15296.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 51 |
|---|---|
| 5-th percentile | 424.45 |
| Q1 | 1476 |
| median | 2613 |
| Q3 | 3956.5 |
| 95-th percentile | 7014.1 |
| Maximum | 15296 |
| Range | 15245 |
| Interquartile range (IQR) | 2480.5 |
Descriptive statistics
| Standard deviation | 2271.652463 |
|---|---|
| Coefficient of variation (CV) | 0.7501078209 |
| Kurtosis | 5.250003237 |
| Mean | 3028.434579 |
| Median Absolute Deviation (MAD) | 1668.472749 |
| Skewness | 1.742875143 |
| Sum | 648085 |
| Variance | 5160404.914 |
| Value | Count | Frequency (%) | |
| 4535 | 3 | 1.0% | |
| 3306 | 2 | 0.7% | |
| 97 | 2 | 0.7% | |
| 3719 | 2 | 0.7% | |
| 227 | 2 | 0.7% | |
| 3511 | 2 | 0.7% | |
| 2095 | 2 | 0.7% | |
| 1476 | 2 | 0.7% | |
| 2355 | 2 | 0.7% | |
| 2833 | 2 | 0.7% | |
| Other values (190) | 193 | 65.2% | |
| (Missing) | 82 | 27.7% |
| Value | Count | Frequency (%) | |
| 51 | 1 | 0.3% | |
| 95 | 1 | 0.3% | |
| 97 | 2 | 0.7% | |
| 162 | 1 | 0.3% | |
| 203 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 15296 | 1 | 0.3% | |
| 12505 | 1 | 0.3% | |
| 11678 | 1 | 0.3% | |
| 9470 | 1 | 0.3% | |
| 8879 | 1 | 0.3% |
| Distinct count | 1 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 |
|---|
| Value | Count | Frequency (%) | |
| 0 | 296 | 100.0% |
festivo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 8 |
| Value | Count | Frequency (%) | |
| 0 | 288 | 97.3% | |
| 1 | 8 | 2.7% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9966216216216215 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 42 |
| Zeros (%) | 14.2% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.997453142 |
|---|---|
| Coefficient of variation (CV) | 0.6665683542 |
| Kurtosis | -1.241520413 |
| Mean | 2.996621622 |
| Median Absolute Deviation (MAD) | 1.706560446 |
| Skewness | 0.004680305814 |
| Sum | 887 |
| Variance | 3.989819056 |
| Value | Count | Frequency (%) | |
| 3 | 43 | 14.5% | |
| 2 | 43 | 14.5% | |
| 6 | 42 | 14.2% | |
| 5 | 42 | 14.2% | |
| 4 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 0 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 0 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 2 | 43 | 14.5% | |
| 3 | 43 | 14.5% | |
| 4 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 6 | 42 | 14.2% | |
| 5 | 42 | 14.2% | |
| 4 | 42 | 14.2% | |
| 3 | 43 | 14.5% | |
| 2 | 43 | 14.5% |
| Distinct count | 4 |
|---|---|
| Unique (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 4 | |
|---|---|
| 3 | |
| 1 | |
| 2 |
| Value | Count | Frequency (%) | |
| 4 | 92 | 31.1% | |
| 3 | 92 | 31.1% | |
| 1 | 86 | 29.1% | |
| 2 | 26 | 8.8% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
| Distinct count | 10 |
|---|---|
| Unique (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.993243243243243 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.667533456 |
|---|---|
| Coefficient of variation (CV) | 0.5244395666 |
| Kurtosis | -1.215710455 |
| Mean | 6.993243243 |
| Median Absolute Deviation (MAD) | 3.109751644 |
| Skewness | -0.3478227975 |
| Sum | 2070 |
| Variance | 13.45080165 |
| Value | Count | Frequency (%) | |
| 12 | 31 | 10.5% | |
| 10 | 31 | 10.5% | |
| 8 | 31 | 10.5% | |
| 7 | 31 | 10.5% | |
| 1 | 31 | 10.5% | |
| 11 | 30 | 10.1% | |
| 9 | 30 | 10.1% | |
| 2 | 29 | 9.8% | |
| 6 | 26 | 8.8% | |
| 3 | 26 | 8.8% |
| Value | Count | Frequency (%) | |
| 1 | 31 | 10.5% | |
| 2 | 29 | 9.8% | |
| 3 | 26 | 8.8% | |
| 6 | 26 | 8.8% | |
| 7 | 31 | 10.5% |
| Value | Count | Frequency (%) | |
| 12 | 31 | 10.5% | |
| 11 | 30 | 10.1% | |
| 10 | 31 | 10.5% | |
| 9 | 30 | 10.1% | |
| 8 | 31 | 10.5% |
| Distinct count | 43 |
|---|---|
| Unique (%) | 14.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.469594594594593 |
|---|---|
| Minimum | 1 |
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 31 |
| Q3 | 42 |
| 95-th percentile | 50 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 15.97664889 |
|---|---|
| Coefficient of variation (CV) | 0.561182873 |
| Kurtosis | -1.229228509 |
| Mean | 28.46959459 |
| Median Absolute Deviation (MAD) | 13.65613587 |
| Skewness | -0.3266565044 |
| Sum | 8427 |
| Variance | 255.2533097 |
| Value | Count | Frequency (%) | |
| 52 | 7 | 2.4% | |
| 51 | 7 | 2.4% | |
| 29 | 7 | 2.4% | |
| 28 | 7 | 2.4% | |
| 27 | 7 | 2.4% | |
| 26 | 7 | 2.4% | |
| 25 | 7 | 2.4% | |
| 24 | 7 | 2.4% | |
| 12 | 7 | 2.4% | |
| 11 | 7 | 2.4% | |
| Other values (33) | 226 | 76.4% |
| Value | Count | Frequency (%) | |
| 1 | 7 | 2.4% | |
| 2 | 7 | 2.4% | |
| 3 | 7 | 2.4% | |
| 4 | 7 | 2.4% | |
| 5 | 7 | 2.4% |
| Value | Count | Frequency (%) | |
| 52 | 7 | 2.4% | |
| 51 | 7 | 2.4% | |
| 50 | 7 | 2.4% | |
| 49 | 7 | 2.4% | |
| 48 | 7 | 2.4% |
working_day
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 424.0 B |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) | |
| True | 246 | 83.1% | |
| False | 50 | 16.9% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.004759498821957385 |
|---|---|
| Minimum | -0.9749279121818236 |
| Maximum | 0.9749279121818236 |
| Zeros | 42 |
| Zeros (%) | 14.2% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | -0.9749279122 |
|---|---|
| 5-th percentile | -0.9749279122 |
| Q1 | -0.7818314825 |
| median | 0 |
| Q3 | 0.7818314825 |
| 95-th percentile | 0.9749279122 |
| Maximum | 0.9749279122 |
| Range | 1.949855824 |
| Interquartile range (IQR) | 1.563662965 |
Descriptive statistics
| Standard deviation | 0.7086201304 |
|---|---|
| Coefficient of variation (CV) | 148.8854514 |
| Kurtosis | -1.50521649 |
| Mean | 0.004759498822 |
| Median Absolute Deviation (MAD) | 0.6270716718 |
| Skewness | -0.0106157593 |
| Sum | 1.408811651 |
| Variance | 0.5021424891 |
| Value | Count | Frequency (%) | |
| 0.4338837391 | 43 | 14.5% | |
| 0.9749279122 | 43 | 14.5% | |
| -0.4338837391 | 42 | 14.2% | |
| -0.9749279122 | 42 | 14.2% | |
| -0.7818314825 | 42 | 14.2% | |
| 0.7818314825 | 42 | 14.2% | |
| 0 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| -0.9749279122 | 42 | 14.2% | |
| -0.7818314825 | 42 | 14.2% | |
| -0.4338837391 | 42 | 14.2% | |
| 0 | 42 | 14.2% | |
| 0.4338837391 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 0.9749279122 | 43 | 14.5% | |
| 0.7818314825 | 42 | 14.2% | |
| 0.4338837391 | 43 | 14.5% | |
| 0 | 42 | 14.2% | |
| -0.4338837391 | 42 | 14.2% |
cos_weekday
Real number (ℝ)
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.0037955736549281846 |
|---|---|
| Minimum | -0.9009688679024191 |
| Maximum | 1.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | -0.9009688679 |
|---|---|
| 5-th percentile | -0.9009688679 |
| Q1 | -0.9009688679 |
| median | -0.222520934 |
| Q3 | 0.6234898019 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1.900968868 |
| Interquartile range (IQR) | 1.52445867 |
Descriptive statistics
| Standard deviation | 0.7079619739 |
|---|---|
| Coefficient of variation (CV) | -186.5230498 |
| Kurtosis | -1.503349059 |
| Mean | -0.003795573655 |
| Median Absolute Deviation (MAD) | 0.6408877408 |
| Skewness | 0.009053080122 |
| Sum | -1.123489802 |
| Variance | 0.5012101565 |
| Value | Count | Frequency (%) | |
| -0.222520934 | 43 | 14.5% | |
| -0.9009688679 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% | |
| -0.9009688679 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| -0.9009688679 | 42 | 14.2% | |
| -0.9009688679 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% | |
| -0.222520934 | 43 | 14.5% | |
| 0.6234898019 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 1 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| -0.222520934 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% |
is_august
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 31 |
| Value | Count | Frequency (%) | |
| 0 | 265 | 89.5% | |
| 1 | 31 | 10.5% |
spring
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 5 |
| Value | Count | Frequency (%) | |
| 0 | 291 | 98.3% | |
| 1 | 5 | 1.7% |
summer
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 188 | 63.5% | |
| 1 | 108 | 36.5% |
autumn
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 206 | 69.6% | |
| 1 | 90 | 30.4% |
winter
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 200 | 67.6% | |
| 1 | 96 | 32.4% |
stockMissingType
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 2 | |
| 1 | 11 |
| Value | Count | Frequency (%) | |
| 0 | 203 | 68.6% | |
| 2 | 82 | 27.7% | |
| 1 | 11 | 3.7% |
Length
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 3 | 75.0% | |
| Other_Punctuation | 1 | 25.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
| Distinct count | 241 |
|---|---|
| Unique (%) | 98.0% |
| Missing | 50 |
| Missing (%) | 16.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 590.0440911730547 |
|---|---|
| Minimum | 266.14285714285717 |
| Maximum | 1055.625 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 266.1428571 |
|---|---|
| 5-th percentile | 332.28125 |
| Q1 | 462.5 |
| median | 594.3125 |
| Q3 | 702.8125 |
| 95-th percentile | 854.9071429 |
| Maximum | 1055.625 |
| Range | 789.4821429 |
| Interquartile range (IQR) | 240.3125 |
Descriptive statistics
| Standard deviation | 164.3824352 |
|---|---|
| Coefficient of variation (CV) | 0.2785934774 |
| Kurtosis | -0.5182699875 |
| Mean | 590.0440912 |
| Median Absolute Deviation (MAD) | 135.4574087 |
| Skewness | 0.188774699 |
| Sum | 145150.8464 |
| Variance | 27021.58499 |
| Value | Count | Frequency (%) | |
| 477.375 | 2 | 0.7% | |
| 554.75 | 2 | 0.7% | |
| 605.5 | 2 | 0.7% | |
| 829 | 2 | 0.7% | |
| 421.875 | 2 | 0.7% | |
| 687 | 1 | 0.3% | |
| 511.375 | 1 | 0.3% | |
| 532.375 | 1 | 0.3% | |
| 790.25 | 1 | 0.3% | |
| 803.75 | 1 | 0.3% | |
| Other values (231) | 231 | 78.0% | |
| (Missing) | 50 | 16.9% |
| Value | Count | Frequency (%) | |
| 266.1428571 | 1 | 0.3% | |
| 267.2857143 | 1 | 0.3% | |
| 274.5 | 1 | 0.3% | |
| 280.875 | 1 | 0.3% | |
| 290.25 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 1055.625 | 1 | 0.3% | |
| 1036.75 | 1 | 0.3% | |
| 975.5714286 | 1 | 0.3% | |
| 941.5 | 1 | 0.3% | |
| 926.1428571 | 1 | 0.3% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 42 |
| Missing (%) | 14.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 602.9495242118015 |
|---|---|
| Minimum | 459.9736842105263 |
| Maximum | 754.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 459.9736842 |
|---|---|
| 5-th percentile | 459.9736842 |
| Q1 | 496.475 |
| median | 635.3684211 |
| Q3 | 745.8974359 |
| 95-th percentile | 754.6 |
| Maximum | 754.6 |
| Range | 294.6263158 |
| Interquartile range (IQR) | 249.4224359 |
Descriptive statistics
| Standard deviation | 117.7709292 |
|---|---|
| Coefficient of variation (CV) | 0.1953246905 |
| Kurtosis | -1.66477379 |
| Mean | 602.9495242 |
| Median Absolute Deviation (MAD) | 109.5968135 |
| Skewness | 0.1891370611 |
| Sum | 153149.1791 |
| Variance | 13869.99177 |
| Value | Count | Frequency (%) | |
| 635.3684211 | 43 | 14.5% | |
| 754.6 | 43 | 14.5% | |
| 459.9736842 | 42 | 14.2% | |
| 521 | 42 | 14.2% | |
| 745.8974359 | 42 | 14.2% | |
| 496.475 | 42 | 14.2% | |
| (Missing) | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 459.9736842 | 42 | 14.2% | |
| 496.475 | 42 | 14.2% | |
| 521 | 42 | 14.2% | |
| 635.3684211 | 43 | 14.5% | |
| 745.8974359 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 754.6 | 43 | 14.5% | |
| 745.8974359 | 42 | 14.2% | |
| 635.3684211 | 43 | 14.5% | |
| 521 | 42 | 14.2% | |
| 496.475 | 42 | 14.2% |
| Distinct count | 241 |
|---|---|
| Unique (%) | 86.1% |
| Missing | 16 |
| Missing (%) | 5.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 934.1020280612246 |
|---|---|
| Minimum | 348.5 |
| Maximum | 2132.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 348.5 |
|---|---|
| 5-th percentile | 504.75 |
| Q1 | 746 |
| median | 892.5 |
| Q3 | 1112.40625 |
| 95-th percentile | 1413.87 |
| Maximum | 2132 |
| Range | 1783.5 |
| Interquartile range (IQR) | 366.40625 |
Descriptive statistics
| Standard deviation | 288.0628119 |
|---|---|
| Coefficient of variation (CV) | 0.3083847409 |
| Kurtosis | 1.052298923 |
| Mean | 934.1020281 |
| Median Absolute Deviation (MAD) | 223.311589 |
| Skewness | 0.7585446128 |
| Sum | 261548.5679 |
| Variance | 82980.1836 |
| Value | Count | Frequency (%) | |
| 746 | 6 | 2.0% | |
| 523 | 4 | 1.4% | |
| 814 | 3 | 1.0% | |
| 691.6 | 2 | 0.7% | |
| 615 | 2 | 0.7% | |
| 1139.5 | 2 | 0.7% | |
| 856.5714286 | 2 | 0.7% | |
| 940 | 2 | 0.7% | |
| 1085 | 2 | 0.7% | |
| 499.875 | 2 | 0.7% | |
| Other values (231) | 253 | 85.5% | |
| (Missing) | 16 | 5.4% |
| Value | Count | Frequency (%) | |
| 348.5 | 1 | 0.3% | |
| 412.4 | 1 | 0.3% | |
| 432.7142857 | 1 | 0.3% | |
| 450 | 1 | 0.3% | |
| 458.625 | 2 | 0.7% |
| Value | Count | Frequency (%) | |
| 2132 | 1 | 0.3% | |
| 1892 | 1 | 0.3% | |
| 1763.4 | 1 | 0.3% | |
| 1726.8 | 1 | 0.3% | |
| 1718.25 | 1 | 0.3% |
meanwd_udsstock
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 907.8800252711544 |
|---|---|
| Minimum | 684.6923076923077 |
| Maximum | 1157.5714285714287 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 684.6923077 |
|---|---|
| 5-th percentile | 684.6923077 |
| Q1 | 706.1818182 |
| median | 929 |
| Q3 | 1112.935484 |
| 95-th percentile | 1157.571429 |
| Maximum | 1157.571429 |
| Range | 472.8791209 |
| Interquartile range (IQR) | 406.7536657 |
Descriptive statistics
| Standard deviation | 174.5641689 |
|---|---|
| Coefficient of variation (CV) | 0.192276693 |
| Kurtosis | -1.459526887 |
| Mean | 907.8800253 |
| Median Absolute Deviation (MAD) | 154.0647922 |
| Skewness | 0.09880003048 |
| Sum | 268732.4875 |
| Variance | 30472.64906 |
| Value | Count | Frequency (%) | |
| 1112.935484 | 43 | 14.5% | |
| 968.5806452 | 43 | 14.5% | |
| 706.1818182 | 42 | 14.2% | |
| 684.6923077 | 42 | 14.2% | |
| 789.8709677 | 42 | 14.2% | |
| 1157.571429 | 42 | 14.2% | |
| 929 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 684.6923077 | 42 | 14.2% | |
| 706.1818182 | 42 | 14.2% | |
| 789.8709677 | 42 | 14.2% | |
| 929 | 42 | 14.2% | |
| 968.5806452 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 1157.571429 | 42 | 14.2% | |
| 1112.935484 | 43 | 14.5% | |
| 968.5806452 | 43 | 14.5% | |
| 929 | 42 | 14.2% | |
| 789.8709677 | 42 | 14.2% |
| Distinct count | 229 |
|---|---|
| Unique (%) | 99.1% |
| Missing | 65 |
| Missing (%) | 22.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3042.899860853432 |
|---|---|
| Minimum | 51.0 |
| Maximum | 15296.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 51 |
|---|---|
| 5-th percentile | 243.1428571 |
| Q1 | 1504.232143 |
| median | 2543.375 |
| Q3 | 3942.8125 |
| 95-th percentile | 7108.625 |
| Maximum | 15296 |
| Range | 15245 |
| Interquartile range (IQR) | 2438.580357 |
Descriptive statistics
| Standard deviation | 2292.257982 |
|---|---|
| Coefficient of variation (CV) | 0.7533136437 |
| Kurtosis | 5.677154598 |
| Mean | 3042.899861 |
| Median Absolute Deviation (MAD) | 1633.3131 |
| Skewness | 1.89050142 |
| Sum | 702909.8679 |
| Variance | 5254446.654 |
| Value | Count | Frequency (%) | |
| 162 | 2 | 0.7% | |
| 97 | 2 | 0.7% | |
| 5313 | 1 | 0.3% | |
| 4051.75 | 1 | 0.3% | |
| 2522.375 | 1 | 0.3% | |
| 2000.2 | 1 | 0.3% | |
| 3365.6 | 1 | 0.3% | |
| 1568.25 | 1 | 0.3% | |
| 803.25 | 1 | 0.3% | |
| 2232.625 | 1 | 0.3% | |
| Other values (219) | 219 | 74.0% | |
| (Missing) | 65 | 22.0% |
| Value | Count | Frequency (%) | |
| 51 | 1 | 0.3% | |
| 89 | 1 | 0.3% | |
| 95 | 1 | 0.3% | |
| 97 | 2 | 0.7% | |
| 115.5714286 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 15296 | 1 | 0.3% | |
| 12505 | 1 | 0.3% | |
| 12121.75 | 1 | 0.3% | |
| 11678 | 1 | 0.3% | |
| 10357.25 | 1 | 0.3% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2442.9691075875285 |
|---|---|
| Minimum | 162.0 |
| Maximum | 4885.578947368421 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 162 |
|---|---|
| 5-th percentile | 162 |
| Q1 | 838.7272727 |
| median | 2197.815789 |
| Q3 | 3966.051282 |
| 95-th percentile | 4885.578947 |
| Maximum | 4885.578947 |
| Range | 4723.578947 |
| Interquartile range (IQR) | 3127.324009 |
Descriptive statistics
| Standard deviation | 1534.445848 |
|---|---|
| Coefficient of variation (CV) | 0.6281069388 |
| Kurtosis | -1.068417295 |
| Mean | 2442.969108 |
| Median Absolute Deviation (MAD) | 1273.828246 |
| Skewness | 0.08902798115 |
| Sum | 723118.8558 |
| Variance | 2354524.059 |
| Value | Count | Frequency (%) | |
| 2918.421053 | 43 | 14.5% | |
| 3966.051282 | 43 | 14.5% | |
| 2084.605263 | 42 | 14.2% | |
| 4885.578947 | 42 | 14.2% | |
| 838.7272727 | 42 | 14.2% | |
| 2197.815789 | 42 | 14.2% | |
| 162 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 162 | 42 | 14.2% | |
| 838.7272727 | 42 | 14.2% | |
| 2084.605263 | 42 | 14.2% | |
| 2197.815789 | 42 | 14.2% | |
| 2918.421053 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 4885.578947 | 42 | 14.2% | |
| 3966.051282 | 43 | 14.5% | |
| 2918.421053 | 43 | 14.5% | |
| 2197.815789 | 42 | 14.2% | |
| 2084.605263 | 42 | 14.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| df_index | fecha | producto | udsstock | udsventa | udsprevisionempresa | promo | festivo | weekday | quarter | month | weekofyear | working_day | sin_weekday | cos_weekday | is_august | spring | summer | autumn | winter | stockMissingType | roll4wd_udsventa | meanwd_udsventa | roll4wd_udsstock | meanwd_udsstock | roll4wd_udsprevisionempresa | meanwd_udsprevisionempresa | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 35 | 2019-06-05 | 48 | 494.0 | 664.0 | 11678.0 | 0.0 | 0.0 | 2 | 2 | 6 | 23 | True | 0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 0.0 | 664.00 | 635.368421 | 494.0 | 968.580645 | 11678.00 | 2918.421053 |
| 1 | 107 | 2019-06-06 | 48 | NaN | 730.0 | 15296.0 | 0.0 | 0.0 | 3 | 2 | 6 | 23 | True | 0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 2.0 | 730.00 | 754.600000 | NaN | 1112.935484 | 15296.00 | 3966.051282 |
| 2 | 179 | 2019-06-07 | 48 | NaN | 575.0 | 12505.0 | 0.0 | 0.0 | 4 | 2 | 6 | 23 | True | -0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 2.0 | 575.00 | 745.897436 | NaN | 929.000000 | 12505.00 | 4885.578947 |
| 3 | 251 | 2019-06-08 | 48 | NaN | 619.0 | 2841.0 | 0.0 | 0.0 | 5 | 2 | 6 | 23 | True | -0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 2.0 | 619.00 | 459.973684 | NaN | 1157.571429 | 2841.00 | 838.727273 |
| 4 | 323 | 2019-06-09 | 48 | NaN | NaN | NaN | 0.0 | 0.0 | 6 | 2 | 6 | 23 | False | -0.781831 | 0.623490 | 0 | 0 | 1 | 0 | 0 | 2.0 | NaN | NaN | NaN | 706.181818 | NaN | 162.000000 |
| 5 | 395 | 2019-06-10 | 48 | NaN | 398.0 | 4219.0 | 0.0 | 0.0 | 0 | 2 | 6 | 24 | True | 0.000000 | 1.000000 | 0 | 0 | 1 | 0 | 0 | 2.0 | 398.00 | 521.000000 | NaN | 684.692308 | 4219.00 | 2197.815789 |
| 6 | 467 | 2019-06-11 | 48 | 882.0 | 405.0 | 3498.0 | 0.0 | 0.0 | 1 | 2 | 6 | 24 | True | 0.781831 | 0.623490 | 0 | 0 | 1 | 0 | 0 | 0.0 | 405.00 | 496.475000 | 882.0 | 789.870968 | 3498.00 | 2084.605263 |
| 7 | 539 | 2019-06-12 | 48 | 1366.0 | 560.0 | 1594.0 | 0.0 | 0.0 | 2 | 2 | 6 | 24 | True | 0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 0.0 | 638.00 | 635.368421 | 712.0 | 968.580645 | 9157.00 | 2918.421053 |
| 8 | 611 | 2019-06-13 | 48 | 1560.0 | 797.0 | 2599.0 | 0.0 | 0.0 | 3 | 2 | 6 | 24 | True | 0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 0.0 | 746.75 | 754.600000 | 1560.0 | 1112.935484 | 12121.75 | 3966.051282 |
| 9 | 683 | 2019-06-14 | 48 | 843.0 | 826.0 | 3914.0 | 0.0 | 0.0 | 4 | 2 | 6 | 24 | True | -0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 0.0 | 637.75 | 745.897436 | 843.0 | 929.000000 | 10357.25 | 4885.578947 |
Last rows
| df_index | fecha | producto | udsstock | udsventa | udsprevisionempresa | promo | festivo | weekday | quarter | month | weekofyear | working_day | sin_weekday | cos_weekday | is_august | spring | summer | autumn | winter | stockMissingType | roll4wd_udsventa | meanwd_udsventa | roll4wd_udsstock | meanwd_udsstock | roll4wd_udsprevisionempresa | meanwd_udsprevisionempresa | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 286 | 20627 | 2020-03-17 | 48 | NaN | 1719.0 | 1757.0 | 0.0 | 0.0 | 1 | 1 | 3 | 12 | True | 0.781831 | 0.623490 | 0 | 0 | 0 | 0 | 1 | 2.0 | 543.500000 | 496.475000 | 926.714286 | 789.870968 | 2804.875 | 2084.605263 |
| 287 | 20699 | 2020-03-18 | 48 | 571.0 | 605.0 | 3145.0 | 0.0 | 0.0 | 2 | 1 | 3 | 12 | True | 0.974928 | -0.222521 | 0 | 0 | 0 | 0 | 1 | 0.0 | 673.750000 | 635.368421 | 1449.400000 | 968.580645 | 3949.000 | 2918.421053 |
| 288 | 20771 | 2020-03-19 | 48 | 349.0 | 1586.0 | 3574.0 | 0.0 | 0.0 | 3 | 1 | 3 | 12 | True | 0.433884 | -0.900969 | 0 | 0 | 0 | 0 | 1 | 0.0 | 696.875000 | 754.600000 | 707.500000 | 1112.935484 | 5210.750 | 3966.051282 |
| 289 | 20843 | 2020-03-20 | 48 | 48.0 | 730.0 | 3505.0 | 0.0 | 0.0 | 4 | 1 | 3 | 12 | True | -0.433884 | -0.900969 | 0 | 0 | 0 | 0 | 1 | 0.0 | 1036.750000 | 745.897436 | 348.500000 | 929.000000 | 7273.500 | 4885.578947 |
| 290 | 20915 | 2020-03-21 | 48 | 1473.0 | 1542.0 | NaN | 0.0 | 0.0 | 5 | 1 | 3 | 12 | True | -0.974928 | -0.222521 | 0 | 0 | 0 | 0 | 1 | 0.0 | 1055.625000 | 459.973684 | 1141.200000 | 1157.571429 | NaN | 838.727273 |
| 291 | 20987 | 2020-03-22 | 48 | 290.0 | NaN | NaN | 0.0 | 0.0 | 6 | 1 | 3 | 12 | False | -0.781831 | 0.623490 | 0 | 1 | 0 | 0 | 1 | 0.0 | NaN | NaN | 615.000000 | 706.181818 | NaN | 162.000000 |
| 292 | 21059 | 2020-03-23 | 48 | 290.0 | NaN | 1476.0 | 0.0 | 0.0 | 0 | 1 | 3 | 13 | True | 0.000000 | 1.000000 | 0 | 1 | 0 | 0 | 1 | 0.0 | 423.285714 | 521.000000 | 615.000000 | 684.692308 | 2524.375 | 2197.815789 |
| 293 | 21131 | 2020-03-24 | 48 | 118.0 | NaN | 429.0 | 0.0 | 0.0 | 1 | 1 | 3 | 13 | True | 0.781831 | 0.623490 | 0 | 1 | 0 | 0 | 1 | 0.0 | 975.571429 | 496.475000 | 803.600000 | 789.870968 | 2150.375 | 2084.605263 |
| 294 | 21203 | 2020-03-25 | 48 | 140.0 | NaN | 1842.0 | 0.0 | 0.0 | 2 | 1 | 3 | 13 | True | 0.974928 | -0.222521 | 0 | 1 | 0 | 0 | 1 | 0.0 | 650.142857 | 635.368421 | 769.800000 | 968.580645 | 3393.250 | 2918.421053 |
| 295 | 21275 | 2020-03-26 | 48 | 753.0 | NaN | 802.0 | 0.0 | 0.0 | 3 | 1 | 3 | 13 | True | 0.433884 | -0.900969 | 0 | 1 | 0 | 0 | 1 | 0.0 | 926.142857 | 754.600000 | 450.000000 | 1112.935484 | 4042.375 | 3966.051282 |